Statistical analysis of the autoregressive modeling of reverberant speech.

نویسندگان

  • Nikolay D Gaubitch
  • Darren B Ward
  • Patrick A Naylor
چکیده

Hands-free speech input is required in many modern telecommunication applications that employ autoregressive (AR) techniques such as linear predictive coding. When the hands-free input is obtained in enclosed reverberant spaces such as typical office rooms, the speech signal is distorted by the room transfer function. This paper utilizes theoretical results from statistical room acoustics to analyze the AR modeling of speech under these reverberant conditions. Three cases are considered: (i) AR coefficients calculated from a single observation; (ii) AR coefficients calculated jointly from an M-channel observation (M > 1); and (iii) AR coefficients calculated from the output of a delay-and sum beamformer. The statistical analysis, with supporting simulations, shows that the spatial expectation of the AR coefficients for cases (i) and (ii) are approximately equal to those from the original speech, while for case (iii) there is a discrepancy due to spatial correlation between the microphones which can be significant. It is subsequently demonstrated that at each individual source-microphone position (without spatial expectation), the M-channel AR coefficients from case (ii) provide the best approximation to the clean speech coefficients when microphones are closely spaced (<0.3m).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Statistical trend analysis and forecast modeling of air pollutants

The study provides a statistical trend analysis of different air pollutants using Mann-Kendall and Sen’s slope estimator approach on past pollutants statistics from air quality index station of Varanasi, India. Further, using autoregressive integrated moving average model, future values of air pollutant levels are predicted. Carbon monoxide, nitrogen dioxide, sulphur dioxide, particu...

متن کامل

Statistical Model of Speech Signals Based on Composite Autoregressive System with Application to Blind Source Separation

This paper presents a new statistical model for speech signals, which consists of a time-invariant dictionary incorporating a set of the power spectral densities of excitation signals and a set of all-pole filters where the gain of each pair of excitation and filter elements is allowed to vary over time. We use this model to develop a combined blind separation and dereverberation method for spe...

متن کامل

Feature and distribution normalization schemes for statistical mismatch reduction in reverberant speech recognition

Reverberant noise has been a major concern in speech recognition systems. Many speech recognition systems, even with state-of-art features, fail to respond to reverberant effects and the recognition rate deteriorates. This paper explores the significance of normalization strategies in reducing statistical mismatches for robust speech recognition in reverberant environment. Most normalization wo...

متن کامل

Modified Maximum Likelihood Estimation in First-Order Autoregressive Moving Average Models with some Non-Normal Residuals

When modeling time series data using autoregressive-moving average processes, it is a common practice to presume that the residuals are normally distributed. However, sometimes we encounter non-normal residuals and asymmetry of data marginal distribution. Despite widespread use of pure autoregressive processes for modeling non-normal time series, the autoregressive-moving average models have le...

متن کامل

Investigating the formal effect of rear wall structure on acoustic parameters of speech halls (Research Article)

Referring to the rear wall in a hall is the furthest element rather than the voice source, therefor the reflections of this structural member play important role in music and speech intelligibly, especially for one-third behind audiences. Hence the form of these structures can be very effective in the acoustical quality of speech halls and auditoria. In this study, four formic structures are ex...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • The Journal of the Acoustical Society of America

دوره 120 6  شماره 

صفحات  -

تاریخ انتشار 2006